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Abstract 

This paper initiates a study into the century-old issue of market predictability from the 
perspective of computational complexity. We develop a simple agent-based model for a stock 
market where the agents are traders equipped with simple trading strategies, and their trades 
together determine the stock prices. Computer simulations show that a basic case of this 
model is already capable of generating price graphs which are visually similar to the recent 
price movements of high tech stocks. In the general model, we prove that if there are a large 
number of traders but they employ a relatively small number of strategies, then there is a 
polynomial-time algorithm for predicting future price movements with high accuracy. On the 
other hand, if the number of strategies is large, market prediction becomes complete in two new 
computational complexity classes CPP and BCPP, where pNP[0(io g ™)] g B cpp c CPP = PP. 
These computational completeness results open up a novel possibility that the price graph of an 
actual stock could be sufficiently deterministic for various prediction goals but appear random 
to all polynomial-time prediction algorithms. 



1 Introduction 

The issue of market predictability has been debated for more than a century (see |7j for earlier 
papers and |5|, 12, Iq, 17| for more recent viewpoints). In 1900, the pioneering work "Theory of 
Speculation" of Louis Bachelier used Brownian motion to analyze the stochastic properties of 
security prices |7j. Since then, Brownian motion and its variants have become textbook tools for 
modeling financial assets. Relatively recently, the radically different methodology of Mandelbrot 



used fractals to approximate price graphs deterministically [18]. In this paper, we initiate a study 



into this long-running issue from the perspective of computational complexity. 
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We develop a simple agent-based model for a stock market @,[T^1- The agents are traders 
equipped with simple trading strategies, and their trades together determine the stock prices. We 
first consider a basic case of this model where there are only two strategies, namely, momentum 
and contrarian strategies. The choice of this base model and thus our general model is justified at 
two levels: (1) Experimental and empirical studies in the finance literature [fL], 3, |6[ PHTl , 14] show 
that a large number of traders primarily follow these two strategies. (2) Our own simulation results 
show that despite its simplicity, the base model is capable of generating price graphs which are 
visually similar to the recent price movements of high tech stocks (Figures |] and |2|) . 

With these justifications, we then consider the issue of market predictability in the general 
model. We prove that if there are a large number of traders but they employ a relatively small 
number of strategies, then there is a polynomial-time algorithm to predict future price movements 
with high accuracy (Theorem ||). On the other hand, if there are also a large number of strategies, 
then the problem of predicting future prices becomes computationally very hard. To describe this 
hardness, we define two new computational complexity classes called CPP and BCPP (Definitions 
H and |9|). We show that some market prediction problems are complete for these two classes 
(Theorems [R] and [L7|) and that pNP[0(io g n)] Q BCpp g cpp _ pR 

These computational completeness results open up the possibility that the price graph of an 
actual stock could be sufficiently deterministic for various prediction purposes but appear random 
to all polynomial-time prediction algorithms. This is in contrast to the most popular academic 
belief that the future price of a stock cannot be predicted from its historical prices because the 
latter are statistically random and contain no information. This new viewpoint also differs from 
the fractal-based methodology in that the price graph of a stock could be a fractal but the fractal 
might not be computable in polynomial time. The findings in this paper can by no means settle 
the debate on market predictability. Our goal is only that this alternative approach could provide 
new insights to the predictability issue in a systematic manner. In particular, it could provide a 
general framework to investigate the many documented technical trading rules p(| and to generate 
novel and significant interdisciplinary research problems for computer science and finance. 

The rest of the paper is organized as follows. Section ^ discusses the basic market model. Sec- 
tion H formulates the general model. Section [| proves the complexity results for market prediction 
in the general model. We conclude the paper with some directions for future research in Section ||. 



2 A Basic Market Model 

In this section, we present a very simple market model, called the deterministic- switching MC 
(DSMC) model. The letter M stands for a momentum strategy, and the letter C for a contrarian 



strategy. These two strategies and the model itself are defined in Section 2.1. Some computer 



simulations for this model are reported in Section 2.2, 

Intuitively, these strategies are heuristics ( "rules of thumb" ) used by traders in the absence of 
reliable asset valuation models. As discussed in [11], a momentum trader may observe a sequence 
of "up" trades (price increments) and execute a buy trade in the anticipation that she will not 
be one of the last buyers, knowing very well that the asset is overpriced. Similarly, she may see 
some "down" trades (price decrements) and then make a sell trade in the hope that there will be 
more sellers after her. In contrast, after detecting a number of "up" (respectively, down) trades, a 
contrarian trader may submit a sell (respectively, buy) trade, anticipating a price reversal. 

Both experimental and empirical studies have shown that traders look at past price dynamics 
to form their expectations of future prices, and a large number of them primarily follow momentum 
or contrarian strategies [l|, 1C]. In addition, the traders may switch between these two diamet- 
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rically opposite strategies. Momentum and contrarian strategies are dominant in the behavior of 
professional market timers as well [|14|] . The use of momentum and contrarian strategies sometimes 
signifies gambling tendencies among traders [||. In fact, a market model with momentum and 
contrarian traders can also be interpreted as a market with noise traders and rational traders, 
where the noise traders essentially follow a momentum strategy while the rational traders attempt 
to exploit the noise traders by following a contrarian strategy ||], |ll|] . 

2.1 Defining the DSMC Model 

In the DSMC model, there is only one stock traded in the market. The model is completely 
specified by three integer parameters m,L,k > 0, and a real parameter a > as follows. 

There are m traders in the market, and each trader's strategy set consists of momentum {M) 
and contrarian (C) strategies. At the beginning of day 1 of the investment period, each trader 
randomly chooses her initial strategy from {A4,C} and an integer li £ [2, L] with equal probability, 
where L is the maximum strategy switching period. This is the only source of randomness in the 
DSMC model; from this point onwards, there is no random choice. 

Rule 1 (Deterministic Strategy Switching Rule) For days 1, . . . , k + 1, there is no trading. 
Each trader starts trading from day k + 2 using her initial strategy. Trader i uses the same strategy 
for li days and switches it at the beginning of every li days. 

The next rule defines the two strategies with respect to a given memory size k, which is the 
same for all traders. 

Rule 2 (Trading Rule) At the beginning of day t, observe the stock prices Pt of days / £ 
[t — (k + 1), t — 1]. For g £ [t — k,t — 1], count the number k u of days g when P g > P g ~i; and 
the number of days when P g < P g -\. The k-day trend is defied as Tr(fc, t) = k u — k^. Then, if 
Tr(k,t) > (respectively, < 0), the momentum strategy M. buys (respectively, sells) one share of 
the stock at the market price determined by Rule ^ below. In contrast, the contrarian strategy C 
sells (respectively, buys) one share of the stock. 

For instance, suppose that k = 2, and investor i picks her initial strategy Ai and li = 2 at the 
beginning of day 1. She then observes the prices of days 1, 2, 3, which are, say, $80, $82, $90. At 
the beginning of day 4, she issues a market order to buy one share of the stock. The orders issued 
by the traders on day 4 together determine the price of day 4 as specified by Rule ||. Suppose that 
the price of day 4 is $91, then investor i issues another market buy order at the beginning of day 
5. Since her li is 2, at the beginning of day 6, she switches her strategy from M to C. 

Rule 3 (Price Adjustment Rule) The prices for days 1, . . . , k + 1 are given. On day t > k + 2, 

let mb and m s be the total numbers of buys and sells, respectively. Then, the price Pt on day t is 
determined by the following equation: 

Pt ~ Pt-i = a-(m b - m s ), 

where a is the unit of price change. 
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O 50 100 150 200 250 

Figure 1: A one-year price sequence generated using the DSMC model. Parameters: number of 
traders m = 20, memory size k = 2, maximum strategy switching period L = 8, unit of price 
change a = 0.25, number of trading days = 250. The price graph appears strikingly similar to the 
recent price movements of high tech stocks. 
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Figure 2: A one-year price sequence generated using the DSMC model. The parameters are the 
same as those for Figure |l[ 
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2.2 Computer Simulation on the DSMC Model 



We have conducted some computer simulations of the DSMC model to test whether it can generate 
realistic price graphs. Because we had to examine the graphs visually, our time constraints limited 
the number of these simulations to only about six hundred. For a large fraction of them, we set 
m = 20, L = 8, and the initial k prices in the range of $70 to $90. We then focused on testing the 
effect of memory size k [p^j] . Two main findings are as follows: 

• For k = 1, the price graphs were not visually real. 

• For k = 2, about one out of four graphs were strikingly similar to those of recent high tech 
stocks, which was a major positive surprise to us. Two representatives of such graphs are 
shown in Figures [l] and ^. 

These two statements are based on our subjective impressions and limited simulations. To 
further understand the DSMC model, it would be useful to automate statistical analysis on the 
price graphs generated by this model and compare them with real stock prices. 



3 A General Market Model 

In this section, we define a market model, called the AS model, where the word AS stands for 
arbitrary strategies. It can be verified in a straightforward manner that the DSMC model is a 
special case of the AS model. 

In the AS model, there is only one stock traded in the market. The model is completely specified 
as follows with five parameters: (1) the number m of traders, (2) a unit a > of price change, (3) 
a set II = {S 1 , . . . ,S h } of strategies, (4) a price adjustment rule (Equation [l] or ^ below), and (5) 
a joint distribution of the population variables X\, . . . , X^. 

Rule 4 (Market Initialization) There are m traders in the market. At the beginning of day 1 
of the investment period, each trader randomly chooses her initial strategy from II. Let Xi be the 
number of traders who choose S l . Then, each Xi is a random variable, which is the only source 
of randomness in the model. (Unlike the DSMC model, because the allowable generality of II, the 
AS model does not need strategy switching.) 

Different joint distributions of the variables Xi lead to different specific models and prediction 



problems. In Section we consider joint distributions that tend to Gaussian in the limit as the 
number m of traders becomes large. In Section |4.3| , we consider the case where the variables Xi 
are independent, and each is or 1 with equal probability. 

Rule 5 (Trading Strategies) There is no trading on day 0. At the beginning of day t > 1, a 
trader observes the historical prices Po, . . . , Pt-\ and reacts by issuing a market order to buy one 
share of the stock, hold (i.e., do nothing), or sell one share according her strategy. Formally, a 
strategy is a collection of functions S = {S\,S2, ...,<%,...}, where each St maps Po, . . . , Pt-i to 
+1 (buy), (hold), or -1 (sell). 

The price Pt of day t is determined at the end of the day by the day's m market orders 
using Rule §. Since the traders choose their strategies randomly, the sequence Pq, P\, . . . , Pt, . . . 
is a stochastic process. We write Tt for the probability space induced by all possible sequences 



(Po, . . . , Pt) [13]. Then, we think of each function St as a random variable on J~t-i- 



We distinguish between strategies that react to price movements and those that ignore them. 
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• S is an active strategy if the functions St may or may not be constant functions. An active 
trader is one with an active strategy. 

• S is a passive strategy if the functions St all are constant functions. A passive trader is one 
with a passive strategy. 

Rule 6 (Price Adjustment) The price Po is given. At the end of day t > 1, the price Pt is 
determined by the day's market orders to buy or sell from the traders. We consider two simple 
rules: 

With the proportional increment (PI) rule, 

h 

P t = P t ^ + a-J2X t -Si, (1) 

i=i 

where a is the unit of price change. Thus we can observe directly the net difference between the 
number of buyers and sellers on day t. 
With the fixed increment (FT) rule, 

Pt = P t -i + a- sign (j^ XiS 1 ^ . (2) 

In this case, the market moves up or down depending on whether the majority of traders are buying 
or selling, but the amount by which it moves is fixed at a. 

For notational brevity, an AS+FI model refers to an AS model with the fixed increment rule, 
and an AS+PI model refers to an AS model with the proportional increment rule. 

In reality, the price tends to move up if there are more buy orders than sell orders; similarly, 
the price tends to move down if there are more sell orders than buy orders. The FI rule is meant to 
model the sign but not the magnitude of the slope of this correlation, while the PI rule attempts to 
model both. Clearly, there can be many other increment rules, which this paper leaves for future 
research. 



4 Predicting the Market 

Informally, the market prediction problem at the beginning of day t is defined as follows: 

• The data consists of (1) the five parameters of an AS-model, i.e., m, a, IT, Xj, and a price 
adjustment rule, and (2) a price history Pq, . . . , Pt-i- 

• The goal is to predict the price Pt by estimating the conditional probabilities Pr[Pt > Pt-\ \ 
P , . . . , P t -!}, Pr[P t < P t _! | P , . . • , P t -i], and Pr[P = P t _ x | P , . . . , P t _i]. 

Note that Pr[P t > P t _i | P , . . . , P t -{\ is symmetric to Pr[P < P t _i | P , . . . , P t _i] and Pr[P t = 
P t -i | P , . . . , P-i] = 1 - Pr[P t > P t -i | P , . . . , P t -i] - Pr[P t < P t -i | P , . . . , P t -i}. Thus, from 
this point onwards, our discussion focuses on estimating Pr[P t > P t -\ \ Pq, . . . , Pt-i]- 

From an algorithmic perspective, we sometimes assume that the price adjustment rule and the 
joint distribution of the variables Xj are fixed, and that the input to the algorithm is m, a, a 
description of II, and the price history. This allows different algorithms for different model families 
as well as side-steps the issue of how to represent the possibly very complicated joint distribution of 
the variables Xj as part of the input. As for the description of II, we only need S{, . . . ,S\ for each 
S % 6 II instead of the whole II, and the description of these functions can simplified by restricting 
their domains to consist of the price sequences consistent with the given price history. 
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4.1 Markets as Systems of Linear Constraints 

In the AS+FI model with parameters m and a, a price sequence Pq, ■ ■ ■ ,Pt and II can yield a set 
of linear inequalities in the population variables X{ as follows. If the price changes on day t, we 
have 

h 

signCPt-Pt-iJ^S^X). (3) 

i=l 

If the price does not change, we have instead the equation 

8=1 

Furthermore, any assignment of the variables Xi that satisfies either inequality is feasible with 
respect to the corresponding price movement on day t. In both cases, S\ is computable from the 
price sequence Pq, . . . , Pt—i- The same statements hold for days 1, . . . , t — 1. Therefore, given m 
and a, we can extract from II and Pq, ■ ■ ■ ,Pt a set of linear constraints on the variables Xi. The 
converse holds similarly. We formalize these two observations in Lemmas [l] and || below. 

Lemma 1 In the AS+FI model with parameters m and a, given II and a price sequence Po, . . . , Pp, 
there are matrices A and B with coefficients in { — 1,0, +1}, h columns each, and rows in total. 
The rows of A (respectively, B) correspond to the days when Pj ^ Pj-i (respectively, Pj = Pj-i)- 
Furthermore, the column vectors x = (X\, . . . , Xh) T consistent with II and Pq, . . . ,Pg are exactly 
those that satisfy Ax > and Bx = 0. The matrices A and B can be computed in time 0{h(3T), 
where T is an upper bound on the time to compute a single <5j from Pq, . . . , Pp over all j G [1, f3] 
and S l . 

Proof: Follows immediately from Equations |3] and |||. I 

Lemma 2 In the AS+FI model with parameters m and a, given a system of linear inequalities 
Ax > 0,Bx = 0, where A and B have coefficients in {— 1,0, +1} with h columns each, and (5 rows 
in total, there exist (1) a set U of h strategies corresponding to the h columns of A and B, and 
(2) a ((3 + I) -day price sequence Pq, . .. ,Pp with the latter f3 days corresponding to the (3 rows of 
A and B. Furthermore, the values of the population variables X\, . . . ,X n are feasible with respect 
to the price movement on day j if and only if column vector x = {X\, . . . ,X n ) T satisfies the j-th 
constraint in A and B. Also, Pq, . . . ,Pr and a description of II can be computed in O{h0) time. 

Proof: Follows immediately from Equations ||] and |||. I 

In the AS+PI model we obtain only equations, of the form: 

J2siX i = ^(P t -P t ^). (5) 

i=l 

In this case there is a direct correspondence between market data and systems of linear equations. 
We formalize this correspondence in Lemmas || and ^ below. 

Lemma 3 In the AS+PI model with parameters m and a, given II and a price sequence Pq, . . . , Pa, 
there is a matrix B with coefficients in { — 1, 0, +1}, h columns, and (3 rows, and a column vector b 
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of length h, such that the column vectors x = {X\, . . . , Xh) consistent with II and Pq, . . . , Pp are 
exactly those that satisfy Bx = b. The coefficients of B and b can be computed in time 0{h(3T), 
where T is an upper bound on the time to compute a single Sj from Pq, . . . , Pp over all j G [1, f3] 
and S l . 

Proof: Follows immediately from Equation ||. I 

Lemma 4 In the AS+PI model with parameters m and a, given a system of linear equations 
Bx = b, where B is a (3 x h matrix with coefficients in {— 1,0, +1}, there exist (1) a set H of h 
strategies corresponding to the h columns of B, and (2) a (fi + l)-day price sequence Po, . . . ,Pp 
with the last (3 days corresponding to the (3 rows of B. Furthermore, the values of the population 
variables X\, . . . , X n are feasible with respect to the price movement on day j if and only if column 
vector x = (X\, . . . , X n ) T satisfies the j-th constraint in B. Also, Pq, . .. ,Pp and a description of 
II can be computed in 0{hj3) time. 

Proof: Follows immediately from Equation ||. I 



4.2 An Easy Case for Market Prediction: Many Traders but Few Strategies 

In Section p~2l] , we show that if an AS+FI market has far more traders than strategies, then it 
takes polynomial time to estimate the probability that the next day's price will rise. In Section 
4.2.2, we discuss why the same analysis technique does not work for an AS+PI market. 



4.2.1 Predicting an AS+FI Market 

For the sake of emphasizing the dependence on m, let Pr m [£'] be the probability that event E 
occurs when there are m traders in the market. 
This section makes the following assumptions: 

El The input to the market prediction problem is simply a price history Pq, . . . , Pt-\- The output 
is lim^oo Pr m [P t > P t _i \ P , . . . , P t _i]. 

E2 The market follows the AS+FI model. 

E3 II is fixed. The values 5j over all i S [l,h] are computable from the input in total time 
polynomial in j. 

E4 Each of the m traders independently chooses a random strategy S l from II with fixed probability 
Pi > 0, where pi H \-Ph = 1- 

The parameter a is irrelevant. 

Notice that the column vector X = (X\, . . . , Xh) T is the sum of m independent identically- 
distributed vector-valued random variables with a center at p = m-(pi, . . . ,Ph) T ■ We recenter 
and rescale X to Y = (X — m-(pl, . . . ,ph) T )/^/m. Then, by the Central Limit Theorem (see, 
e.g., Theorem 29.5]), as m — > +oo, Y converges weakly to a normal distribution centered at 
the /i-dimensional vector (0, . . . ,0) T . In Theorem || below, we rely on this fact to estimate the 
probability that the market rises for price histories that occur with nonzero probability. 
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Theorem 5 Assume that linim^oo Pr m [Po, ■ • • , -Ft— 1] > 0. Then there is a fully polynomial-time 
approximation scheme for estimating linim^oo Pr m [P t > Pt-i \ Pq, . . . , Pt—i] from Po, . . . , Pt-\- 
The time complexity of the scheme is polynomial in (1) the length t of the price history, (2) the 
inverse of the relative error bound e, and (3) the inverse of the failure probability rj. 

Remark. We omit the explicit dependency of the running time in h and pi, . . . ,ph in order to 
concentrate on the main point that market prediction is easy with this section's four assumptions. 
The parameters h and Pi, ■ ■ ■ ,Ph are constant under the assumptions. 

Proof: We use Lemma [l] to convert the price history Pq, . . . , Pj_i and the strategy set II into 
a system of linear constraints AX > and BX = 0, with the next day's price change Pt — Pt-i 
determined by sign(c-X') for some c. Since the values <Sj are computable in time polynomial in j, 
this conversion takes time polynomial in t. 

Then, Pr m [P , • • • , Pt-i] = Px m [AX > OABX = 0]. Since limbec Pv m [AX > OABX = 0] > 0, 
the constraints in B must be vacuous; in other words, for each Pj = with i £ [0, t — 1], the 
corresponding constraint in B is 0-Xi + • • • + 0-Xh = 0. Therefore, Pr m [Po, . . . , P*-i] = Pr m [AX > 
0]. Furthermore, since both A and c are constant with respect to m, 

1,™ Pr \p > P ip P i _ lim m ^ oo Pr m [^X>0A C -X>0] 
hm Pr m [P t > P t _ x | P , . . .,P t _ x ] lim _ ooPrm[ylx>0] • ( 6 ) 



So to compute the desired linim^oo Pr m [P t > Pt-i \ Pq, . . . , Pt-i], we compute linim^oo Vv m [AX > 
A c-X > 0] and lim m ^oo Pi m [AX > 0] as follows. 

To avoid the degeneracy caused by J2i=i %i = m , we work with X' = (X±, . . . , Xh-i) T instead 
of X by replacing Xh with m — J2i=i an d making related changes. Let p' = (p±, . . . ,Ph-i) T , 
which is the center of X' . As is true for Y, as m — > +oo, the vector Y' = (X' — m-p')/^/m 
converges weakly to a normal distribution centered at the (h — l)-dimensional point (0, . . . , 0) T . 
Under the assumption that each pi is nonzero, the distribution of Y' is full-dimensional (within 
its restricted (h — l)-dimensional space), as in the limit the variance of each coordinate Y( is 
nonzero conditioned on the values of the other coordinates, which implies that the smallest subspace 
containing the distribution must contain all h — 1 axes. We can calculate the covariance matrix of 
Y' directly from the pi, as it is equal to the covariance matrix for a single trader: on the diagonal, 
Cu = Pi — pf] and for off-diagonal elements, Cjj = —piPj. Given C, Y' has density p(x) = ae x 
for some constant a, and we can evaluate this density in 0(h 2 ) time given x, which is O(l) time 
under our assumption that IT is fixed. 

Let Ai be the i-th constraint of A, i.e., Ai^X\ + • • • + A^X^ > 0. Let A' i denote the constraint 
(A it x - A i>h , A i>h _i - A i)h ). Let d = (ci - c h , . . . , c h _ x - c h ). 

We next convert the constraints of A on X into constraints on Y'. First of all, notice that 
AiX = ^/m^A'jY') + m-Aip. So AiX > if and only if A[Y' > -Jm-Aip. The term -y/m-Aip 
may not be constant. In such a case, as m — > oo, the hyper plane bounding the half space 
A'jY' > —^/m-Aip keeps moving away from the origin, which presents some technical complication. 
To remove this problem, we analyze the term in three cases. If Aip < 0, then since m-p is the 
center of X, as m — > oo, Pr m L4jX < 0] converges to 1. In other words, A^ is infeasible with 
probability 1 in the limit. Then, since linim^oo Pr m [Po, . . . , Pt— i] > 0, such Ai cannot exist in 
A. Similarly, if A^p > 0, then lmim^oo Pr m [AiX > 0] = 1 and Ai is vacuous. The interesting 
constraints are those for which Aip = 0; in this case, by algebra, AiX > if and only if A'^Y' > 0. 
Thus, let D be the matrix formed by these constraints; D can be computed in 0(ht) time. Then, 
since D is constant with respect to m, lim m ^oo Pr m [AX > 0] = linim^oo Pr m [DY 1 > 0]. Similarly, 
Pr rn [AX > A c-X > 0] converges to (1) 0, (2) Vr m [DY' > 0], or (3) Pr m [DY' > A c'-Y' > 0] for 
case (1) c-p < 0, case (2) c-p > 0, or case (3) op = 0, respectively. 
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Therefore, by Equation ^, lim m _ )00 Pr m [Pt > Pt-i \ Pq, ■ ■ ■ ,Pt-i] equals for case (1) and 
equals 1 for case (2). Case (3) requires further computation. 

]• Pr rp . p |p p i _ lim^ooPr m [ZJr > A c'-F' > 0] 
lim Jr"r m [f t > ft-i fo, ■ ■ ■ , ft-l\ — - j— — -j . (7) 



in 



The numerator and denominator of the ratio in Equation [7| are both integrals of the distribution 
of Y' in the limit over the bodies of possibly infinite convex polytopes. To deal with the possible 
infiniteness of the convex bodies DY' > A c'-Y' > and DY' > 0, notice that the density 
drops exponentially. So we can truncate the regions of integration to some finite radius around 
the (h — l)-dimensional origin (0, . . . ,0) T with only exponentially small loss of precision. Finally, 
since the distribution of Y' in the limit is normal, by applying the Applegate-Kannan integration 
algorithm for log-concave distributions |§] to the numerator and denominator separately, we can 
approximate limm^oo Pr m [P t > Pt-\ \ Pq, • • • , Pt-l] within the desired time complexity. I 



4.2.2 Remarks on Predicting an AS+PI Market 

The probability estimation technique based on taking mtooo does not appear to be applicable to 
the AS+PI model for the following reasons. 

First of all, by Lemma ||, the input price history induces a system of linear equations BX = b. 
If any equation in BX = b is not equivalent to X\ + • • • + Xh = m or O-Xi + • • • + O-Xh = 0, then 
lim m _oc Pr m [P , • • • , Pt-l] = 0. 

A natural attempt to overcome this seemingly technical difficulty would be to (1) solve BX = b 
to choose a maximal set U of independent variables Xi and (2) evaluate Pr m [Po, • • • , Pt-i] i n the 
probability space induced by this set. Still, a single constraint such as Bi^-X\+- ■ •+Bi t f l -Xf l = a-mo 
with Bij > for all j E [1, h] and B it j/ > for some Xf £ U forces linim^oo Pr m [Po 5 • • • , Pt-l] = 
in the new probability space. This is due to the fact that rriQ is constant with respect to m. 

A further attempt would be to evaluate lmim^oo Pr m [Pt > Pt-i \ Po, . . . , Pt-i] by directly 
working with the probability space induced by Po, . . . , Pf—i- This also does not work because we 
show below that the market prediction problem can be reduced to the case where taking a limit 
in m has no effect on the distribution of the strategy counts. Suppose that we are given a market 



which follows the assumptions El, E3, and E4 of Section 4.2.1 except that this market uses the PI 
rule and has niQ traders. We construct a new market with any m > niQ traders with the following 
modifications: 

1. The price history Pq, . . . , Pt-i is extended with an extra day into Pq, . . . , P^, where 
Pj = Pj for < j < t — 1. Each strategy Si is extended into a new strategy S[ where (1) on 
day j £ [1, t - 1], S'^Pq, P,_i) = 5j(P , . . . , Pj-i), (2) on day t, S[ always buys, and (3) 
on day t + 1, S^, . . . , P{) = ^(Po, . . . , P t -x). Thus, P{ = P[_ x + a-m . 

2. Add a passive strategy <5^_j that always holds. 

3. Let p[ = ^pi for 1 < i < h and p' h+1 = \- 

Note that since P/ — P/_ x = a-rriQ, m — niQ traders choose the passive strategy Sh+i- Also, the 
new market and the new price history can accommodate any m > tuq traders. Note that because 
of the constraint P[ — P[_\ = a-mQ, the probability distribution of (X\, ■ ■ ■ , X^) T conditioned 
on Pq, . . . ,P[ in the new market for each m > mo is identical to the probability distribution of 
(Xi, ■ ■ ■ , Xh) T conditioned on Po,...,Pt_i in the original market with m = tuq. Furthermore, 
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Pr m [P/ +1 > P( | Pq,..., P[] = Pr mo [P t > P t -i | Pq, . . . , Pt-i]- So we have obtained the desired 
reduction. 

Consequently, we are left with a situation where the number of active strategies may be com- 
parable to the number of traders. Such a market turns out to be very hard to predict, as shown 



next in Section 4.3. 



4.3 A Hard Case for Market Prediction: Many Strategies 



Section 4.2 shows that predicting an AS+FI market is easy (i.e., takes polynomial time) when the 
number m of traders vastly exceeds the number h of strategies. In this section, we consider the case 
where every trader may have a distinct strategy, and show that predicting an AS+FI or AS+PI 
market becomes very hard indeed. 

We now define two decision-problem versions of market prediction. Both versions make the 
following assumption: 

• Each Xi is independently either or 1 with equal probability. 
The bounded market prediction problem is: 

• Input: a set of n passive strategies and a price history spanning n days such that the proba- 
bility that the market rises on day n+1 conditioned on the price history is either (1) greater 
than 2/3 or (2) less than 1/3. 

• Question: Which case is it, case (1) or case (2)? 
The unbounded market prediction problem is: 

• Input: a set of n passive strategies and a price history spanning n days. 

• Question: Is the probability that the market rises on day n + 1 conditioned on the price 
history greater than 1/2 (without the usual e term)? 

The unbounded market prediction problem has less financial payoff than the bounded one due 
to different probability thresholds. For each of these two problems, there are in effect two versions, 
depending on which price increment rule is used; however, both versions turn out to be equally 
hard. These two problems can be analyzed by similar techniques, and our discussion below focuses 
on the bounded market prediction problem with a hardness theorem for the unbounded market 



prediction problem in Section 4.3.4 



We show in Section 4.3.1 how to construct passive strategies and price histories such that 
solving bounded market prediction is equivalent to estimating the probability that a Boolean 
circuit outputs 1 on a random input conditioned on a second circuit outputting 1. In Section 



4.3.2, we show that this problem is hard for p Np [°( lo g n )] an( j complete for a class that lies between 



pNFfo(iogn)] an( j pp Thus bounded market prediction is not merely NP-hard, but cannot be solved 
in the polynomial-time hierarchy at all unless the hierarchy collapses to a finite level. 

4.3.1 Reductions from Circuits to Markets 

Lemma ^ converts a circuit into a system of linear inequalities, while Lemma converts a system 
of linear inequalities into a system of linear equations. These systems can then be converted into 
AS+FI and AS+PI market models using Lemmas § and 0, respectively. 
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Note that the restriction in Lemma ^ to circuits consisting of 2-input NOR gates is not an 
obstacle to representing arbitrary combinatorial circuits (with constant blow-up), as 2-input NOR 
gates are universal. 

Lemma 6 For any n-input Boolean circuit C consisting of m 2-input NOR gates, there exists a 
system Ax > of 3m + 2 linear constraints inn + m + 2 unknowns and a length n + m + 2 column 
vector c with the following properties: 

1. Both A and c have coefficients in { — 1,0, +1} that can be computed in time 0((n + m) 2 ). 

2. Any 0-1 vector {x\ , . . . , x n ) has a unique 0-1 extension x = {x\, . . . , x n , x n+ i, . . . x n + m +2) 
satisfying Ax > 0. 

3. If Ax > 0, then cx > if and only if C(x±, X2, ■ ■ ■ , x n ) = 1. 

Proof: Let x n+ k represent the output of the k-th NOR gate, where 1 < k < m. Without loss 
of generality we assume that gate m is the output gate. 

The variables x n+m+ i and x n+m +2 are dummies to allow for a zero right-hand-side in Ax > 0; 
our first two constraints are x n+m+ i > and x n+m+ 2 > 0. 

Suppose gate k has inputs Xi and Xj. The NOR operation is implemented by the following 
three linear inequalities: 

X{ -f- 3?n+fc < 2, 

Xj -\- x n +k < 2, 

Xi -\- Xj -\- X n J r ^ !> 0. 

The first two constraints ensure that the output is never 1 if an input is 1, while the last requires that 
the output is 1 if both inputs are 0; the constraints are thus satisfied if and only if x n+ k = ~^{xi Vxj). 
Using the dummy variables, the first two constraints are written as 

Xi 3?n+fc + X n -\- m +i + X n j rm j r 2 !> 0, 

Xj X n _)_fc + X n -\- m +i + X n s rm -\-2 > 0. 

Let Ax > be the system obtained by combining all of these inequalities. Then for each 
(x\, . . . , x n ), Ax > determines x n +fc for all k > 1. The vector c is chosen so that CX — Xn-\-m,' ^ 

One might suspect that the fixed increment rule's ability to hide the exact values of the left-hand 
side of each constraint is critical to disguise the inner workings of the circuit. However, by adding 
slack variables we can translate the inequalities into equations, allowing the use of a proportional 
increment rule without revealing extra information. 

Lemma 7 Let Ax > be a system of m linear inequalities in n variables where A has coefficients 
in { — 1, 0, +1}. Then there is a system By = 1 of mn — m+ 1 linear equations in 2mn — 3m + n+ 1 
variables with the following properties: 

1. B has coefficients in {— 1,0, +1} that can be computed in time 0((mn) 2 ). 

2. There is a bijection f : x i— > y between the 0-1 solutions x to Ax > and the 0-1 solutions y 
to By = 1, such that Xj = yj for 1 < j < n whenever y = f(x). 
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Proof: For each 1 < i < m, let Ai be the constraint J2j ^ij x j > 0- To turn these inequalities 
into equations, we add slack variables to soak up any excess over 1, with some additional care taken 
to ensure that there is a unique assignment to the slack variables for each setting of the variables 

Xj. 

We will use the following 0-1 variables, which we think of as alternate names for y\ through 

y2mn—3m+n+l ■ 



Variables 


Purpose 


Indices 


Count 


Xj 


original variables 


1 < j < ri 


n 


u 


constant 1 


none 


1 


Sij 


slack variables for 


1 < i < m,l < j < n — 1 


m{n — 1) 


Hj 


slack variables for > Sij+i 


1 < i < m,l < j < n — 2 


m(n — 2) 



Name 


Equation 


Purpose 


Indices 


Count 


U 


u = 1 


set u 


none 


1 


Bi 


J2j AijXj — J2j s ij = i 


represent Ai 


1 < i < m 


m 


Sij 


Sij — 1 


require > Sij+i 


1 < i < m, 1 < j < n — 2 


m(n — 2) 



Observe that for each i, J2j s ij can t a ^e on any integer value between and n — 1, and that 
for any fixed value of <7j, the Sij constraints uniquely determine the values of and ijj for all j. 
So each constraint Bi permits \i = J2j AijXj to take on precisely the same values 1 to n that Ai 
does, and each \i uniquely determines Oi and thus the assignment of all and tij. I 



4.3.2 Conditional Probability Complexity Classes 

Suppose that we take a polynomial-time probabilistic Turing machine, fix its inputs, and use the 
usual Cook's Theorem construction to turn it into a circuit whose inputs are the random bits used 
during its computation. Then, we can feed the resulting circuit to Lemmas || and || to obtain an 
AS+FI market model in which there is exactly one assignment of population variables for each set 
of random bits, and the price rises on the last day if and only if the output of the Turing machine is 
1. By applying Lemma to the intermediate system of linear inequalities, we can similarly convert 
a circuit to an AS+PI model. It follows that bounded market prediction is BPP-hard for either 
model. But with some cleverness, we can exploit the conditioning on past history to show that 



bounded market prediction is in fact much harder than this. We do so in Section 4.3.3, after a 
brief detour through computational complexity in this section. 

We proceed to define some new counting classes based on conditional probabilities. One of 
these, BCPP, has the useful feature that bounded market prediction is BCPP-complete. We will 
use this fact to relate the complexity of bounded market prediction to more traditional complexity 
classes. 

The usual counting classes of complexity theory (PP, BPP, R, ZPP, C=, etc.) are defined 
in terms of counting the relative numbers of accepting and rejecting states of a nondeterministic 
Turing machine. We will define a new family of counting classes by adding a third decision state 
that does not count for the purposes of determining acceptance or rejection. 

A noncommittal Turing machine is a nondeterministic Turing machine with three decision 
states: accept, reject, and abstain. We represent a noncommittal Turing machine as a deterministic 
Turing machine which takes a polynomial number of random bits in addition to its input; each 
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assignment of the random bits gives a distinct computation path. A computation path is accept- 
ing/rejecting/abstaining if it ends in an accept /reject /abstain state, respectively. We often write 
1, 0, or _L as shorthand for the output of an accepting, rejecting, or abstaining path. 

Conditional versions of the usual counting classes are obtained by carrying over their definitions 
from standard nondeterministic Turing machines to noncommittal Turing machines, with some care 
in handling the case of no accepting or rejecting paths. We can still think of these modified classes 
as corresponding to probabilistic machines, but now the probabilities we are interested in are 
conditioned on not abstaining. 

Definition 8 The conditional probabilistic polynomial-time class (CPP) consists of those languages 
L for which there exists a polynomial-time noncommittal Turing machine M such that x G L if 
and only if the number of accepting paths when M is run with input x exceeds the number of 
rejecting paths. 



Definition 9 The bounded conditional probabilistic polynomial-time class (BCPP) consists of those 
languages L for which there exists a constant e > and a polynomial-time noncommittal Turing 
machine M such that (1) x G L implies that a fraction of at least ^ + e of the total number of 
accepting and rejecting paths are accepting and (2) x ^ L implies that a fraction of at least ^ + e 
of the total number of accepting and rejecting paths are rejecting. 



Definition 10 The conditional randomized polynomial-time class (CR) consists of those languages 
L for which there exists a constant e > and a polynomial-time noncommittal Turing machine 
M such that (1) x £ L implies that a fraction of at least e of the total number of accepting and 
rejecting paths are accepting, and (2) x ^ L implies that there are no accepting paths. 



As we show in Theorems 11 and 12, CPP and CR turn out to be the same as the unconditional 



classes PP and NP, respectively. 



Theorem 11 CPP = PP. 



Proof: First of all, PP C CPP because a PP machine is a CPP machine that happens 
not to have any abstaining paths. For the inverse direction, represent each abstaining path of a 
CPP machine by a pair consisting of one accepting and one rejecting path, and each accepting or 
rejecting path by two accepting or rejecting paths. Then the resulting PP machine accepts if and 
only if the CPP machine does. I 



Theorem 12 CR = NP. 



Proof: To show NP C CR, replace each rejecting path of an NP machine with an abstaining 
path in a CR machine. For the inverse direction, replace each abstaining path of the CR machine 
with a rejecting path in the NP machine. I 

BCPP appears to be a more interesting class. Since it is clearly a subset of CPP, we have: 



Corollary 13 BCPP C PP. 
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Proof: Immediate from Theorem 
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and the definition of BCPP and CPP. I 



On the other hand, BCPP appears to be much stronger than the analogous non-conditional 
class BPP. For example, it is straightforward to show that NP C BPP. Use the representation of 
an NP-machine as a deterministic machine M that takes some polynomial number of "hint" bits 
in addition to its input, and replace these N hint bits with N random bits r. In addition, supply 
another 2N random bits r' , which will be used to amplify the conditional probability of accepting 
paths. Now let M'(x,r,r') accept if M(x,r) accepts; reject if M(x,r) rejects and r' = 0; and 
abstain if M(x,r) rejects and r' ^ 0. Then if M has any accepting path on input x, M' has at 
least 2 2N accepting paths and at most 2^ — 1 rejecting paths, for an exponentially large probability 
of accepting — since we have amplified the small number of accepting paths so that they overwhelm 
the few rejectors. Alternatively, if M(x,r) never accepts, neither does M'. 

By repeating this sort of amplification of "good" paths, we can in fact simulate O(logn) queries 
of an NP-oracle, as stated in the following theorem. 

Theorem 14 pWP[0(io g n)] Q BCPP 

Proof: Let M{x, hi, h^, ■ ■ ■ , h^) be a deterministic implementation a p Np [°( lo g n )] machine, 
where k = O(logra) and each h{ supplies a witness for the i-th oracle query. We will show that the 
language L(M) accepted by M is in BCPP. 

To simplify the presentation, we assume that each oracle query is a Boolean formula with a fixed 
number m of variables, where m is polynomial in n = \x\, and that hi is an assignment for those 
variables. We assume that M consists of a sequence of functions Mi,Mi, . . . for generating 
oracle queries, a set of k verifiers Vi, . . . , 14 for verifying the witnesses hi, and a combining function 
M* that produces the output from the input and the outputs of the Vi . Each function Mi takes as 
input the input \x\ and the outputs of Vi through Vi-i. Vi sees the output of Mi and the input hi. 
M* sees the input \x\ and the outputs of V\ through 14. The output of the combined machine M 
is the output of any computation path where hi is chosen so that V{(Mi, hi) = 1 whenever such an 
hi exists. In other words, we demand that hi be a satisfying assignment when possible, and ignore 
those paths where satisfiable queries are issued but satisfying assignments are not supplied. 

We will represent M with a noncommittal machine M'{x, r%, . . . , r^, r'), where each rj is a 
random bit- vector replacing the corresponding hi, and r' is an extra supply of random bits used to 
amplify the good computation paths to overwhelm the bad ones. This amplification process is a 
little complicated, because it is not enough to amplify paths that find good witnesses to particular 
queries; it may be that a bad witness for an earlier query causes some Mi to issue a different 
query from the correct one. So we must amplify a path that finds a witness to query i enough to 
overwhelm not only the exponentially many invalid witnesses to query i, but also the exponentially 
many valid witnesses that might be returned to instances queries i + 1 through k based on an 
incorrect answer to query i. 

Let v = (vi,V2, ■ ■ ■ ,Vf.) be the vector of outputs of V,. For each v, we define an amplification 
exponent A(v) as follows: 

k 

A{v) =J2vi2 k -\mk + l). 
i=i 

We will write Ai for the coefficient 2 <yk ~ t \mk + 1); these coefficients A4 are chosen to make Equa- 
tion |8] work below. 

Now let M'{x, n, ■ ■ ■ ,r/~,r') = M{x, r%, . . . , r^) whenever r[ = for all i > A(v), where 
v is the output of V\ through Vk in the computation of M. If r ■ / for some i > A(v), 
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M'(x, ri, . . . , rfe, r') =_L. The effect of the r' bits is to set the weight of each non-abstaining 
path to 2 A K 

Clearly M' can be computed in polynomial time as long as \r'\ is polynomial. The number of 
r' bits needed is the maximum value of A(v), which is (mk + l)(2 k+1 - 1) = 0(n c k2 h ). For this to 
be polynomial, we need k = O(logn). 

A good path p is precisely one for which each Vi is the correct output of the NP-oracle. A bad 
path p' is one in which one or more of the v\ values is incorrect. We will match bad paths with 
good paths, and show that the weight of each good path is much larger than the total weight of 
all bad paths mapped to it. 

Identify each path with the sequence r±, . . . , that generates it. Let b = b\, . . . , b^ generate 
some bad path. Let i be the first point at which b{ is an invalid witness to a satisfiable query. Then 
there is a good path a = a±, . . . such that a,j = bj for j < i. Furthermore, if v a and v b are the 
vectors of verifier outputs for a and b, then not only is = Vj for j < i, but also vf = 1 while 
v\ = since the only false verifier outputs are false negatives. 



The maximum value for A(v b ) is obtained if Vj = 1 for j > i; so we have 
A(v a ) - A(v b ) > l^Ajv] + AA- [Y.A.v) + 




Now 

k k 
A i = 2 k ' i (mk + 1) = (mk + l)+ ^ 2 k ~ j {mk + 1) = (mk + 1) + ^ A j> ( 8 ) 

j=i+l J=i+1 

so Ai — Ylj=i+i Aj = mk + 1. But the ratio between the weight of b and its corresponding good 
path a is then at most 2 A ^ vh ^~ A<yVa ^ = 2~ mk ~ 1 . Since there are only 2 mk paths altogether, there 
are fewer than 2 mk bad paths; thus, the ratio between the total weight of all bad paths mapped 
to a and the weight of a is less than 2 mk 2~ mk ~ 1 = \. Summing over all good paths shows that 
the total weight of all bad paths is less than half the total weight of all good paths, so at least | 
of all non-abstaining paths are good. It follows that, conditioned on not abstaining, M' accepts 
with probability greater than | if M accepts, and accepts with probability less than | if M rejects. 
Hence L(M) G BCPP. I 

An interesting open question is where exactly BCPP lies between p Np [°( lo g™)] anc j pp. it i s 
conceivable that by cleverly exploiting the power of conditioning to amplify low-probability events 
one could show BCPP = PP. However, we will content ourselves with the much easier result of 
showing that the usual amplification technique for BPP also applies to BCPP. 

Theorem 15 If L £ BCPP, then there exists a noncommittal Turing machine M such that the 
probability that M accepts conditioned on not abstaining is at least 1 — f(n) if x £ L and at most 
f(n) if x ^ L, where n = \x\ and f(n) is any function of the form 2~°(™ c ) for some constant c > 0. 

Proof: Let membership in L be computed by M' . Assume x £ L (the case x ^ L is 
symmetric). Consider k independent executions of M' with input x; call the random variables 
representing their outputs Yi, Y2, . . . , Because the executions are independent, for any 0-1 
vector of values yi PrfViYi = yi \ ViYi ^_L] = TJ^ Pr[l^ = yt \ So conditioning on 

no abstentions, J2i Yi nas a binomial distribution with p = \ + e, and Chernoff bounds imply 
P r EjY; < I j \fiYi /_L] is exponentially small in ke. Since e is constant and we can make k 
polynomially large in n, the result follows. I 
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4.3.3 Bounded Market Prediction Is BCPP-complete 



In Section 4.3.2, we have denned the complexity class BCPP and have shown that it contains the 
powerful class p Np [ ( lo s n )]. i n this section, we show that bounded market prediction is complete 
for BCPP. In a sense, this result says that market prediction is a universal prediction problem: if 
we can predict a market, we can predict any event conditioned on past history as long as we can 
sample from an underlying discrete probability space whose size is at most exponential. 

It also says that bounded market prediction is very hard. That is, using Theorems ^ and [L4|, 
even if the next day's price is determined with all but an exponentially small probability, it cannot 
be solved in the polynomial-time hierarchy unless the hierarchy collapses to a finite level. 

Theorem 16 The bounded market prediction problem is complete for BCPP, in either the AS+FI 
or the AS+PI model. 

Proof: First we show that bounded market prediction is a member of BCPP. Given a market, 
construct a noncommittal Turing machine M whose input is the price history and strategies, and 
whose random inputs supply the settings for the population variables Aj. Let M abstain if the 
price history is inconsistent with the input and population variables; depending on the model, 
this is either a matter of checking the linear inequalities produced by Lemma |l| or the equations 
produced by Lemma ||. Otherwise, M accepts if the market rises and rejects if the market falls on 
the next day. The probability that M accepts thus equals the probability that the market rises: 
either more than 2/3 or less than 1/3. Since the problem is to distinguish between these two cases, 
M solves the problem within the definition of a BCPP-machine. 

In the other direction, we reduce from any BCPP-language L. Suppose L is accepted by some 
BCPP-machine M . We will translate M and its input x into a bounded market prediction problem. 



First use Theorem 15 to amplify the conditional probability that M accepts to either more than 2/3 
or less than 1/3 as bounded market prediction demands. Then convert M into two polynomial-size 
circuits, one computing 

f if M(x,r) =J_; 
I 1 j£M(x,r) /I, 



and the other computing 



Ci(r) 



if M(x,r) 7^1; 

1 if Af(a?,r) = 1. 



Without loss of generality we may assume that Cj_ and C\ are built from NOR gates. Applying 
Lemma ^ to each yields two sets of constraints Ay_y > and A\y > and column vectors cj_ 
and ci such that c^y > if and only if C^y = 1 and c\x > if and only if C\{x) = 1, where y 
satisfies the previous linear constraints and x is the initial prefix of y consisting of variables not 
introduced by the construction of Lemma |6[ We also have from Lemma ^ that there is a one-to- 
one correspondence between assignments of x and assignments of y satisfying the A constraints, so 
probabilities are not affected by this transformation. 

Now use Lemma § to construct a market model in which Ay_y > 0, A\y > 0, and cj_y > are 
enforced by the strategies and price history, and sign(ciy) determines the price change on the next 
day of trading. Thus the consistent settings of the variables Aj are precisely those corresponding 
to settings of r for which Cy_{r) = 1, or, in other words, those yielding computation paths that do 
not abstain. The market rises when C\{r) = 1, or when M accepts. So if we can predict whether 
the market rises or falls with conditional probability at least 2/3, we can predict the likely output 
of M. It follows that bounded market prediction for the AS+FI model is BCPP-hard. 
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To show the similar result for the AS+PI model, use Lemma to convert the constraints 
A^_y > 0, Aiy > into a system of linear equations Bz = 1, and then proceed as before, using 
Lemma |^ to convert this system to a price history and letting c±z determine the price change (and 
thus the sign of the price change) on the next day of trading. I 



4.3.4 Unbounded Market Prediction is CPP-complete 

The unbounded market prediction problem seems harder because the probability threshold in 
question is ^ with no e bound in contrast to the thresholds | and | for the bounded market 
prediction problem. The following theorem reflects this intuition. However, since we do not know 
whether BCPP is distinct from PP, we do not know whether unbounded prediction is strictly 
harder. 

Theorem 17 The unbounded market prediction problem is complete for CPP = PP, in either the 
AS+FI or the AS+PI model. 



Proof: Similar to the proof of Theorem 16. I 



5 Future Research Directions 

There are many problems left open in this paper. Below we briefly discuss some general directions 
for further research. 

We have reported a number of simulation and theoretical results for the AS model. As for 
empirical analysis, it would be of interest to fit actual market data to the model. We can then 
use the estimated parameters to (1) test whether the model has any predicative power and (2) 
test the effectiveness of new or known trading algorithms. This direction may require carefully 
choosing "realistic" strategies for 11. Besides the momentum and contrarian strategies, there are 
some popular ones which are worth considering, such as those based on support levels. Investment 
newsletters could be a useful source of such strategies. 

The AS model is an idealized one. We have chosen such simplicity as a matter of research 
methodology. It is relatively easy to design highly complicated models which can generate very 
complex market behavior. A more challenging and interesting task is to design the simplest possible 
model which can generate the desired market characteristics. For instance, a significant research 
direction would be to find the simplest model in which market prediction is computationally hard. 
On the other hand, it would be of great interest to find the most general models in which market 
prediction takes only polynomial time. For this goal, we can consider injecting more realism into 
the model by introducing resource-bounded learning (the generality of II is equivalent to unbounded 
learning), variable memory size, transaction costs, buying power, limit orders, short sell, options, 
etc. 



Acknowledgments 

This work originated with David Fischer's senior project in 1999, advised by Ming- Yang Kao. David 
would like to thank his father and role model, Professor Michael Fischer, for teaching, mentoring, 
and inspiring him throughout college. 



18 



References 

[1] P. B. Andreassen and S. Krause. Judgemental extrapolation and the salience of change. 
Journal of Forecasting, 9(4):347-372, 1990. 

[2] D. Applegate and R. Kannan. Sampling and integration of near log-concave functions. In 

Proceedings of the 23rd Annual ACM Symposium on Theory of Computing, pages 156-163, 
1991. 

[3] P. Billingsley. Probability and Measure. John Wiley and Sons, second edition, 1986. 
[4] F. Black. Noise. Journal of Finance, 41(3):529-543, 1986. 

[5] J. Y. Campbell, A. W. Lo, and A. C. MacKinlay. The Econometrics of Financial Markets. 
Princeton University Press, Princeton, NJ, 1997. 

[6] R. G. Clarke and M. Statman. Bullish or bearish? Financial Analysts Journal, May/June:63- 
72, 1998. 

[7] P. A. Cootner. The Random Character of Stock Market Prices. MIT Press, Cambridge, MA, 
1964. 

[8] R. H. Day and W. H. Huang. Bulls, bears, and market sheep. Journal of Economic Behavior 
and Organization, 14:299-330, 1990. 

[9] W. DeBondt. What do economists know about the stock market? Journal of Portfolio 
Management, Winter:84-91, 1991. 

[10] W. DeBondt. Betting on trends: Intuitive forecasts of financial risk and return. International 
Journal of Forecasting, 9:355-371, 1993. 

[11] J. B. DeLong, A. Shleifer, L. H. Summers, and R. J. Waldmann. Noise trader risk in financial 
markets. Journal of Political Economy, 98(4):703-738, 1990. 

[12] E. Fama. Efficient capital markets: II. Journal of Finance, 46:1575-1618, 1991. 

[13] I. Karatzas and S. E. Shreve. Methods of Mathematical Finance, volume 39 of Applications of 
Mathematics. Springer Verlag, New York, NY, 1998. 

[14] A. Kumar. Behavior of momentum following and contrarian market timers. Working Paper 
99-01, International Center for Finance, School of Management, Yale University, Jan. 1999. 

[15] A. W. Lo and A. C. MacKinlay. A Non-Random Walk Down the Wall Street. Princeton 
University Press, Princeton, NJ, 1999. 

[16] T. Lux. The socio-economic dynamics of speculative markets: Interacting agents, chaos, and 
the fat tails of return distributions. Journal of Economic Behavior and Organization, 33:143- 
165, 1998. 

[17] B. G. Malkiel. A Random Walk down Wall Street. Norton, New York, NY, 1990. 

[18] B. B. Mandelbrot. Fractals and scaling in finance. Springer- Verlag, New York, 1997. Discon- 
tinuity, concentration, risk, Selecta Volume E, with a foreword by R. E. Gomory. 



19 



[19] S. Mullainathan. A memory based model of bounded rationality. Working paper, Department 
of Economics, MIT, Apr. 1998. 

[20] R. Sullivan, A. Timmermann, and H. White. Data-snooping, technical trading rules and the 
bootstrap. Journal of Finance, 54:1647-1692, 1999. 



20 



